Parameter-Free Hierarchical Co-clustering by n-Ary Splits
نویسندگان
چکیده
Clustering high-dimensional data is challenging. Classic metrics fail in identifying real similarities between objects. Moreover, the huge number of features makes the cluster interpretation hard. To tackle these problems, several co-clustering approaches have been proposed which try to compute a partition of objects and a partition of features simultaneously. Unfortunately, these approaches identify only a predefined number of flat co-clusters. Instead, it is useful if the clusters are arranged in a hierarchical fashion because the hierarchy provides insides on the clusters. In this paper we propose a novel hierarchical co-clustering, which builds two coupled hierarchies, one on the objects and one on features thus providing insights on both them. Our approach does not require a pre-specified number of clusters, and produces compact hierarchies because it makes n−ary splits, where n is automatically determined. We validate our approach on several high-dimensional datasets with state of the art competitors.
منابع مشابه
K-ary Clustering with Optimal Leaf Ordering for Gene Expression Data
MOTIVATION A major challenge in gene expression analysis is effective data organization and visualization. One of the most popular tools for this task is hierarchical clustering. Hierarchical clustering allows a user to view relationships in scales ranging from single genes to large sets of genes, while at the same time providing a global view of the expression data. However, hierarchical clust...
متن کاملOn Randomly Projected Hierarchical Clustering with Guarantees
1 Hierarchical clustering (HC) algorithms are generally limited to small data instances due to their runtime costs. Here we mitigate this shortcoming and explore fast HC algorithms based on random projections for single (SLC) and average (ALC) linkage clustering as well as for the minimum spanning tree problem (MST). We present a thorough adaptive analysis of our algorithms that improve prior w...
متن کاملABELIAN STATE-CLOSED SUBGROUPS OF AUTOMORPHISMS OF m-ARY TREES
The group Am of automophisms of a one-rooted m-ary tree admits a diagonal monomorphism which we denote by x. Let A be an abelian state-closed (or self-similar) subgroup of Am. We prove that the recurrence and tree-topological closure A∗ of A is additively a finitely presented Zm [[x]]module where Zm is the ring of m-adic integers. Moreover, if A∗ is torsion-free then it is a finitely generated ...
متن کاملNEW TYPES OF FUZZY n-ARY SUBHYPERGROUPS OF AN n-ARY HYPERGROUP
In this paper, the new notions of ``belongingness ($in_{gamma}$)"and ``quasi-coincidence ($q_delta$)" of a fuzzy point with a fuzzyset are introduced. By means of this new idea, the concept of$(alpha,beta)$-fuzzy $n$-ary subhypergroup of an $n$-aryhypergroup is given, where $alpha,betain{in_{gamma}, q_{delta},in_{gamma}wedge q_{delta}, ivq}$, andit is shown that, in 16 kinds of $(alpha,beta...
متن کاملVisualization, Search and Analysis of Hierarchical Translation Equivalence in Machine Translation Data
Translation equivalence constitutes the basis of all Machine Translation systems including the recent hierarchical and syntax-based systems. For hierarchical MT research it is important to have a tool that supports the qualitative and quantitative analysis of hierarchical translation equivalence relations extracted from word alignments in data. In this paper we present such a toolkit and exempl...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009